首页> 外文OA文献 >Effective methods and strategies for massive small files processing based on Hadoop

【2h】

Effective methods and strategies for massive small files processing based on Hadoop

机译：基于Hadoop的海量小文件处理的有效方法和策略

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The Hadoop framework provides a powerful way to handle Big Data. Since Hadoop has inherent defects of high memory overhead and low computing performance in processing massive small files, we implement three methods and propose two strategies for solving small files problem in this paper. First, we implement three methods, i.e., Hadoop Archives (HAR), Sequence Files (SF) and CombineFileInputFormat (CFIF), to compensate the existing defects of Hadoop. Moreover, we propose two strategies for meeting the actual needs of different users. Finally, we evaluate the efficiency of the implemented methods and the validity of the proposed strategies. The experimental results show that our methods and strategies can improve the efficiency of massive small files processing, thereby enhancing the overall performance of Hadoop. © 2014 ISSN 1881-803X.

机译：Hadoop框架提供了一种处理大数据的强大方法。由于Hadoop在处理海量小文件时具有内存高，计算性能低的固有缺陷，因此我们实现了三种方法，并提出了两种解决小文件问题的策略。首先，我们实现了三种方法，即Hadoop存档（HAR），序列文件（SF）和CombineFileInputFormat（CFIF），以弥补Hadoop的现有缺陷。此外，我们提出了两种策略来满足不同用户的实际需求。最后，我们评估了所实施方法的效率以及所提出策略的有效性。实验结果表明，我们的方法和策略可以提高海量小文件处理的效率，从而提高Hadoop的整体性能。 ©2014 ISSN 1881-803X。

著录项

作者
Xia, D.; Wang, B.; Rong, Z.; Li, Y.; Zhang, Zili;
展开▼
作者单位

展开▼
年度 2014
总页数
原文格式 PDF
正文语种 eng
中图分类

相似文献

外文文献
中文文献
专利

1. Storage-Optimization Method for Massive Small Files of Agricultural Resources Based on Hadoop [J] . Jun Liu Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2019,第4a138期

机译：基于Hadoop的农业资源大规模小文件的存储优化方法
2. A Hadoop Processing Method for Massive Sensor Network Data Based on Internet of Things [J] . Zhang Yanxin International journal of wireless information networks . 2020,第2期

机译：基于事物互联网的大规模传感器网络数据的HADOOP处理方法
3. Research on Database Massive Data Processing and Mining Method basedon Hadoop Cloud Platform [J] . Zhao Xiaoyong, Yang Chunrong The Open Automation and Control Systems Journal . 2016,第1期

机译：基于Hadoop Cloud平台的数据库海量数据处理与挖掘方法研究
4. Hadoop Massive Small File Merging Technology Based on Visiting Hot-Spot and Associated File Optimization [C] . Jian-feng Peng, Wen-guo Wei, Hui-min Zhao, International conference on brain-inspired cognitive systems . 2018

机译：基于访问热点和关联文件优化的Hadoop大规模小文件合并技术
5. A Statistical Ftechin Approach for Effective Management of Physical Register File in Simulatenous Multi Threading Processors [D] . Ramanathan, Madhava Krishnan. 2017

机译：一种统计技术方法，用于在同时多线程处理器中有效管理物理寄存器文件
6. A Hadoop-Based Method to Predict Potential Effective Drug Combination [O] . Yifan Sun, Yi Xiong, Qian Xu, -1

机译：一种基于Hadoop的潜在有效药物组合预测方法
7. Research on Database Massive Data Processing and Mining Method based on Hadoop Cloud Platform [O] . Zhao Xiaoyong, Yang Chunrong 2014

机译：基于Hadoop云平台的数据库大规模数据处理和挖掘方法研究
8. Extending the Strategy Based Risk Model Using the Delphi Method: An Application to the Validation Process for Research and Developmental (R&D) Satellites [R] . Langenbrunner, A. J., Trautwein, M. R. 2009

机译：使用Delphi方法扩展基于策略的风险模型：在研发（R＆D）卫星验证过程中的应用

Effective methods and strategies for massive small files processing based on Hadoop

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅